A Precise Distance Metric for Mixed Data Clustering using Chi-square Statistics
نویسندگان
چکیده
منابع مشابه
On the multi _ chi-square tests and their data complexity
Chi-square tests are generally used for distinguishing purposes; however when they are combined to simultaneously test several independent variables, extra notation is required. In this study, the chi-square statistics in some previous works is revealed to be computed half of its real value. Therefore, the notion of Multi _ Chi-square tests is formulated to avoid possible future confusions. In ...
متن کاملLearning a Mahalanobis distance metric for data clustering and classification
Article history: Received 7 October 2007 Received in revised form 27 February 2008 Accepted 16 May 2008
متن کاملScalable Chi-Square Distance versus Conventional Statistical Distance for Process Monitoring with Uncorrelated Data Variables
Multivariate statistical process control charts are often used for process monitoring to detect out-of-control anomalies. However, multivariate control charts based on conventional statistical distance measures, such as the one used in the Hotelling’s T 2 control chart, cannot scale up to large amounts of complex process data, e.g. data with a large number of variables and a high rate of data s...
متن کاملMaximally selected chi-square statistics for ordinal variables.
The association between a binary variable Y and a variable X having an at least ordinal measurement scale might be examined by selecting a cutpoint in the range of X and then performing an association test for the obtained 2 x 2 contingency table using the chi-square statistic. The distribution of the maximally selected chi-square statistic (i.e. the maximal chi-square statistic over all possib...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Research Journal of Applied Sciences, Engineering and Technology
سال: 2015
ISSN: 2040-7459,2040-7467
DOI: 10.19026/rjaset.10.1846